Integrating CAT and MT in AnglaBharti-II Architecture
نویسنده
چکیده
Machine translation (MT) is a complex and a difficult task. It is not possible to achieve human competing performance with the present state of technology. Automating the process of translation of natural languages requires a number of knowledge sources and their appropriate invocation in the translation engine. A practical machine translation system with limited resources cannot embody all the knowledge sources that the human beings use. However, performance of an MT system can be considerably improved if the automated translation system is integrated with supporting modules that provide synergy for arriving at correct translations. The computer assisted tools (CAT) that identify the limitations of the MT system and provide clues to cope up with them, constitute an important module for enhancing the MT system performance. This paper presents details of AnglaBharti-II system architecture highlighting the role of CAT in the system. AnglaBharti-II is a system for translating English to Indian languages. AnglaBharti is primarily a rule-based system (RBMT). The input English sentence is transformed to a pseudo-interlingual structure called PLIL (Pseudo Lingua for Indian Languages) using a CFG like pattern directed rule-base. RBMT presents limitations of its own in dealing with real-life texts. We have tried to overcome some of these limitations in AnglaBharti-II architecture by integrating some additional modules. These additional modules are basically CAT tools incorporating translation memory, raw and generalized example-bases, interactive and automated pre-editing, paraphrasing, failure analysis and a number of heuristics that attempt to deal with a variety of constructs that are frequently encountered in a real life English text.
منابع مشابه
Melatonin and Alpha Lipoic Acid: Possible Mitigants for Lopinavir/Ritonavir- Induced Renal Toxicity in Male Albino Rats
Introduction: This study evaluated the effects of pretreatments with melatonin (MT), and Alpha Lipoic acid (ALA) on lopinavir/ritonavir (LPV/r) -induced serum levels of creatinine (Cr), urea (U), uric acid (Ua) and kidney levels of malondialdehyde (MDA), superoxide dismutase (SOD), glutathione (GSH) and catalase (CAT) in male albino rats. Effects of treatments with MT and ALA were also evaluate...
متن کاملA New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly
Computer-aided translation (CAT) system is the most popular tool which helps human translators perform language translation efficiently. To further improve the efficiency, there is an increasing interest in applying the machine translation (MT) technology to upgrade CAT. Post-editing is a standard approach: human translators generate the translation by correcting MT outputs. In this paper, we p...
متن کاملMelatonin and alpha lipoic acid as possible therapies for lopinavir/ritonavir-induced hepatotoxicity in albino rats
Introduction: The use of lopinavir/ritonavir (LPV/r) has decreased morbidity and mortality due to human immunodeficiency virus (HIV); however its use could be impaired by hepatotoxicity. Therefore, this study was designed to investigate the effects of melatonin (MT) and alpha lipoic acid (ALA) on LPV/r-induced hepatotoxicity in male albino rats. Methods: Rats were divided into groups and treate...
متن کاملIntegrating Civil Defense Emergency Management of Cities (Case Study: District 10 of Tehran)
Earthquake has always endangered cities. This article mainly focuses on the question “whether seismic damages could be diminished through the presentation a preventive operational pattern dealing with physical and nonphysical features of civil defense”. Tehran, the most populous metropolis among Iranian cities is to the south of Alborz mountain range, located on potentially dangerous fault- lin...
متن کاملPhrase-based Machine Translation in a Computer-assisted Translation Environment
We explore the problem of integrating a phrase-based MT system within a computerassisted translation (CAT) environment. We argue that one way of achieving successful integration is to design an MT system that behaves more like the translation memory (TM) component of CAT systems. This implies producing MT output that is consistent with that of a TM when high-similarity material exists in the tr...
متن کامل